Search CORE

132 research outputs found

Nucleosome DNA sequence structure of isochores

Author: AE Rapoport
AE Vinogradov
CE Shannon
DA Denisov
Edward N Trifonov
EN Trifonov
EN Trifonov
EN Trifonov
EN Trifonov
EN Trifonov
F Salih
G Bernardi
G Mengeritsky
HR Chung
I Gabdank
I Gabdank
M Costantini
M Costantini
M Costantini
M Costantini
M Kato
S Kogan
T Bettecken
Thomas Bettecken
VB Zhurkin
VB Zhurkin
Zakharia M Frenkel
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Significant differences in G+C content between different isochore types suggest that the nucleosome positioning patterns in DNA of the isochores should be different as well. Results Extraction of the patterns from the isochore DNA sequences by Shannon N-gram extension reveals that while the general motif YRRRRRYYYYYR is characteristic for all isochore types, the dominant positioning patterns of the isochores vary between TAAAAATTTTTA and CGGGGGCCCCCG due to the large differences in G+C composition. This is observed in human, mouse and chicken isochores, demonstrating that the variations of the positioning patterns are largely G+C dependent rather than species-specific. The species-specificity of nucleosome positioning patterns is revealed by dinucleotide periodicity analyses in isochore sequences. While human sequences are showing CG periodicity, chicken isochores display AG (CT) periodicity. Mouse isochores show very weak CG periodicity only. Conclusions Nucleosome positioning pattern as revealed by Shannon N-gram extension is strongly dependent on G+C content and different in different isochores. Species-specificity of the pattern is subtle. It is reflected in the choice of preferentially periodical dinucleotides.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MPG.PuRe

Repertoires of the Nucleosome-Positioning Dinucleotides

Author: A Bolshoy
A Krueger
A Thastrom
AB Cohanim
AB Cohanim
C Davey
Cathal Seoighe
CS Davey
E Segal
Edward N. Trifonov
EN Trifonov
EN Trifonov
EN Trifonov
EN Trifonov
F Salih
F Salih
G Mengeritsky
H Herzel
I Gabdank
LE Ulanovsky
M Costantini
M Kato
S Pennings
SB Kogan
TA Manolio
Thomas Bettecken
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

It is generally accepted that the organization of eukaryotic DNA into chromatin is strongly governed by a code inherent in the genomic DNA sequence. This code, as well as other codes, is superposed on the triplets coding for amino acids. The history of the chromatin code started three decades ago with the discovery of the periodic appearance of certain dinucleotides, with AA/TT and RR/YY giving the strongest signals, all with a period of 10.4 bases. Every base-pair stack in the DNA duplex has specific deformation properties, thus favoring DNA bending in a specific direction. The appearance of the corresponding dinucleotide at the distance 10.4 xn bases will facilitate DNA bending in that direction, which corresponds to the minimum energy of DNA folding in the nucleosome. We have analyzed the periodic appearances of all 16 dinucleotides in the genomes of thirteen different eukaryotic organisms. Our data show that a large variety of dinucleotides (if not all) are, apparently, contributing to the nucleosome positioning code. The choice of the periodical dinucleotides differs considerably from one organism to another. Among other 10.4 base periodicities, a strong and very regular 10.4 base signal was observed for CG dinucleotides in the genome of the honey bee A. mellifera. Also, the dinucleotide CG appears as the only periodical component in the human genome. This observation seems especially relevant since CpG methylation is well known to modulate chromatin packing and regularity. Thus, the selection of the dinucleotides contributing to the chromatin code is species specific, and may differ from region to region, depending on the sequence context

Public Library of Science (PLOS)

Crossref

PubMed Central

MPG.PuRe

Skittle: A 2-Dimensional Genome Visualization Tool

Author: Birney
D Sussillo
E Lieberman-Aiden
EN Trifonov
EN Trifonov
G Benson
GM Weinstock
GS Baldwin
I López-Villaseñor
J Sánchez
JF Canny
John C Sanford
Josiah D Seaman
M Costantini
MB Gerstein
MK Rudd
P Schieg
S Kurtz
X She
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background It is increasingly evident that there are multiple and overlapping patterns within the genome, and that these patterns contain different types of information - regarding both genome function and genome history. In order to discover additional genomic patterns which may have biological significance, novel strategies are required. To partially address this need, we introduce a new data visualization tool entitled Skittle. Results This program first creates a 2-dimensional nucleotide display by assigning four colors to the four nucleotides, and then text-wraps to a user adjustable width. This nucleotide display is accompanied by a "repeat map" which comprehensively displays all local repeating units, based upon analysis of all possible local alignments. Skittle includes a smooth-zooming interface which allows the user to analyze genomic patterns at any scale. Skittle is especially useful in identifying and analyzing tandem repeats, including repeats not normally detectable by other methods. However, Skittle is also more generally useful for analysis of any genomic data, allowing users to correlate published annotations and observable visual patterns, and allowing for sequence and construct quality control. Conclusions Preliminary observations using Skittle reveal intriguing genomic patterns not otherwise obvious, including structured variations inside tandem repeats. The striking visual patterns revealed by Skittle appear to be useful for hypothesis development, and have already led the authors to theorize that imperfect tandem repeats could act as information carriers, and may form tertiary structures within the interphase nucleus.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A model-independent approach to infer hierarchical codon substitution dynamics

Author: A Jiménez-Sánchez
AS Novozhilov
C Kosiol
C Kosiol
CR Woese
CR Woese
DG Hwang
DS Riddle
DT Jones
E Trifonov
EN Trifonov
EN Trifonov
EN Trifonov
FH Crick
GH Gonnet
HA Simon
JEM Hornos
JG Kemeny
JR Jungck
JTF Wong
M Di Giulio
M Meilă
MA Jiménez-Montano
MA Jiménez-Montaño
Martin Nilsson Jacobi
MN Jacobi
MO Dayhoff
MS Johnson
MW Nirenberg
O Görnerup
O R
Olof Görnerup
R Marquez
S Itzkovitz
S Whelan
SD Copley
T Bollenbach
T Wilhelm
TD Wu
V Karasev
VR Chechetkin
W Taylor
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Codon substitution constitutes a fundamental process in molecular biology that has been studied extensively. However, prior studies rely on various assumptions, e.g. regarding the relevance of specific biochemical properties, or on conservation criteria for defining substitution groups. Ideally, one would instead like to analyze the substitution process in terms of raw dynamics, independently of underlying system specifics. In this paper we propose a method for doing this by identifying groups of codons and amino acids such that these groups imply closed dynamics. The approach relies on recently developed spectral and agglomerative techniques for identifying hierarchical organization in dynamical systems. Results We have applied the techniques on an empirically derived Markov model of the codon substitution process that is provided in the literature. Without system specific knowledge of the substitution process, the techniques manage to "blindly" identify multiple levels of dynamics; from amino acid substitutions (via the standard genetic code) to higher order dynamics on the level of amino acid groups. We hypothesize that the acquired groups reflect earlier versions of the genetic code. Conclusions The results demonstrate the applicability of the techniques. Due to their generality, we believe that they can be used to coarse grain and identify hierarchical organization in a broad range of other biological systems and processes, such as protein interaction networks, genetic regulatory networks and food webs.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Chalmers Research

Chalmers Publication Library

Discrete wavelet transform de-noising in eukaryotic gene splicing

Author: AS Nair
D Anastassiou
EN Trifonov
JG Proakis
KP Soman
PP Vaidyanathan
R Kakumani
S Tiwari
Tessamma Thomas
Tina P George
TW Fox
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background This paper compares the most common digital signal processing methods of exon prediction in eukaryotes, and also proposes a technique for noise suppression in exon prediction. The specimen used here which has relevance in medical research, has been taken from the public genomic database - GenBank. Methods Here exon prediction has been done using the digital signal processing methods viz. binary method, EIIP (electron-ion interaction psuedopotential) method and filter methods. Under filter method two filter designs, and two approaches using these two designs have been tried. The discrete wavelet transform has been used for de-noising of the exon plots. Results Results of exon prediction based on the methods mentioned above, which give values closest to the ones found in the NCBI database are given here. The exon plot de-noised using discrete wavelet transform is also given. Conclusion Alterations to the proven methods as done by the authors, improves performance of exon prediction algorithms. Also it has been proven that the discrete wavelet transform is an effective tool for de-noising which can be used with exon prediction algorithms.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Mutability and Evolvability: Indirect Selection for Mutability

Author: D G King
EN Trifonov
G Bell
G Williams
M Petrie
P Capy
PD Sniegowski
RA Fisher
S Cotton
T Bataillon
Y Kashi
Y Kashi
Publication venue: OpenSIUC
Publication date: 01/08/2007
Field of study

Crossref

OpenSIUC

Predicting Human Nucleosome Occupancy from Primary Sequence

Nucleosomes are the fundamental repeating unit of chromatin and comprise the structural building blocks of the living eukaryotic genome. Micrococcal nuclease (MNase) has long been used to delineate nucleosomal organization. Microarray-based nucleosome mapping experiments in yeast chromatin have revealed regularly-spaced translational phasing of nucleosomes. These data have been used to train computational models of sequence-directed nuclesosome positioning, which have identified ubiquitous strong intrinsic nucleosome positioning signals. Here, we successfully apply this approach to nucleosome positioning experiments from human chromatin. The predictions made by the human-trained and yeast-trained models are strongly correlated, suggesting a shared mechanism for sequence-based determination of nucleosome occupancy. In addition, we observed striking complementarity between classifiers trained on experimental data from weakly versus heavily digested MNase samples. In the former case, the resulting model accurately identifies nucleosome-forming sequences; in the latter, the classifier excels at identifying nucleosome-free regions. Using this model we are able to identify several characteristics of nucleosome-forming and nucleosome-disfavoring sequences. First, by combining results from each classifier applied de novo across the human ENCODE regions, the classifier reveals distinct sequence composition and periodicity features of nucleosome-forming and nucleosome-disfavoring sequences. Short runs of dinucleotide repeat appear as a hallmark of nucleosome-disfavoring sequences, while nucleosome-forming sequences contain short periodic runs of GC base pairs. Second, we show that nucleosome phasing is most frequently predicted flanking nucleosome-free regions. The results suggest that the major mechanism of nucleosome positioning in vivo is boundary-event-driven and affirm the classical statistical positioning theory of nucleosome organization

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

PerPlot & PerScan: tools for analysis of DNA curvature-related periodicity in genomic nucleotide sequences

Author: A Bolshoy
A Fire
A Theologis
C Jacq
CJ Bult
E Segal
EN Trifonov
H Herzel
H Herzel
H Willenbrock
J Mrázek
J Mrázek
J Mrázek
L Kozobay-Avraham
LE Ulanovsky
MY Tolstorukov
P Schieg
P Worning
R Kiyama
R Rohs
RD Fleischmann
SG Gu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

On the Evolution of the Standard Genetic Code: Vestiges of Critical Scale Invariance from the RNA World in Current Prokaryote Genomes

Author: A Arneodo
BB Mandelbrot
BB Mandelbrot
BJ West
BJ West
C Guerrier-Takada
C Woese
C Woese
C.-K Peng
CJ Michel
CJ Michel
CR Woese
D Arquès
D Sornette
DG Arquès
DJ Kenneth
E Szathmáry
EN Trifonov
EN Trifonov
F Jacob
FHC Crick
FHC Crick
G Eriani
G Frey
GF Joyce
GM Nagel
H Herzel
HB Nicholas
I López-Villaseñor
J Konecny
J Maynard-Smith
JA García
JA García
JB Bassingthwaighte
JC Shepherd
JCW Shepherd
JCW Shepherd
JCW Shepherd
JEM Hornos
José A. García
JT Trevors
JTze-Fei Wong
Juan R. Bobadilla
K Kruger
K Wilson
L Ribas de Pouplana
LE Orgel
M Balter
M Delarue
M Eigen
M Eigen
M Eigen
M Eigen
M Eigen
M Eigen
Marco V. José
Mukund Thattai
MV José
MV José
N Paul
P Bernáola-Galván
P Nissen
PP Amaral
R Jolivet
R Sánchez
RD Knight
SJ Freeland
SN Rodin
SV Buldyrev
TH Jukes
Tzipe Govezensky
W Gilbert
WK Johnston
Publication venue: Public Library of Science
Publication date: 02/02/2009
Field of study

Herein two genetic codes from which the primeval RNA code could have originated the standard genetic code (SGC) are derived. One of them, called extended RNA code type I, consists of all codons of the type RNY (purine-any base-pyrimidine) plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. In order to test if putative nucleotide sequences in the RNA World and in both extended RNA codes, share the same scaling and statistical properties to those encountered in current prokaryotes, we used the genomes of four Eubacteria and three Archaeas. For each prokaryote, we obtained their respective genomes obeying the RNA code or the extended RNA codes types I and II. In each case, we estimated the scaling properties of triplet sequences via a renormalization group approach, and we calculated the frequency distributions of distances for each codon. Remarkably, the scaling properties of the distance series of some codons from the RNA code and most codons from both extended RNA codes turned out to be identical or very close to the scaling properties of codons of the SGC. To test for the robustness of these results, we show, via computer simulation experiments, that random mutations of current genomes, at the rates of 10−10 per site per year during three billions of years, were not enough for destroying the observed patterns. Therefore, we conclude that most current prokaryotes may still contain relics of the primeval RNA World and that both extended RNA codes may well represent two plausible evolutionary paths between the RNA code and the current SGC

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Physical properties of naked DNA influence nucleosome positioning and correlate with transcription start and termination sites in yeast

Abstract Background In eukaryotic organisms, DNA is packaged into chromatin structure, where most of DNA is wrapped into nucleosomes. DNA compaction and nucleosome positioning have clear functional implications, since they modulate the accessibility of genomic regions to regulatory proteins. Despite the intensive research effort focused in this area, the rules defining nucleosome positioning and the location of DNA regulatory regions still remain elusive. Results Naked (histone-free) and nucleosomal DNA from yeast were digested by microccocal nuclease (MNase) and sequenced genome-wide. MNase cutting preferences were determined for both naked and nucleosomal DNAs. Integration of their sequencing profiles with DNA conformational descriptors derived from atomistic molecular dynamic simulations enabled us to extract the physical properties of DNA on a genomic scale and to correlate them with chromatin structure and gene regulation. The local structure of DNA around regulatory regions was found to be unusually flexible and to display a unique pattern of nucleosome positioning. Ab initio physical descriptors derived from molecular dynamics were used to develop a computational method that accurately predicts nucleosome enriched and depleted regions. Conclusions Our experimental and computational analyses jointly demonstrate a clear correlation between sequence-dependent physical properties of naked DNA and regulatory signals in the chromatin structure. These results demonstrate that nucleosome positioning around TSS (Transcription Start Site) and TTS (Transcription Termination Site) (at least in yeast) is strongly dependent on DNA physical properties, which can define a basal regulatory mechanism of gene expression

Crossref

Springer

Springer - Publisher Connector

PubMed Central

eScholarship - University of California